Starting Small - Learning with Adaptive Sample Sizes

نویسندگان

  • Hadi Daneshmand
  • Aurélien Lucchi
  • Thomas Hofmann
چکیده

For many machine learning problems, data is abundant and it may be prohibitive to make multiple passes through the full training set. In this context, we investigate strategies for dynamically increasing the effective sample size, when using iterative methods such as stochastic gradient descent. Our interest is motivated by the rise of variance-reduced methods, which achieve linear convergence rates that scale favorably for smaller sample sizes. Exploiting this feature, we show – theoretically and empirically – how to obtain significant speed-ups with a novel algorithm that reaches statistical accuracy on an n-sample in 2n, instead of n log n steps.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...

متن کامل

Suitable Starting of a Small Teaching Group

  Using small groups in teaching and learning has a long history. Evidence suggests that by applying some defined guidelines, it would be possible to reinforce the effectiveness of the small groups teaching. One of these guidelines is related to ice-breaker activities which would be applied by the facilitators at the start of the session, which are noticeably significant. The aim of the presen...

متن کامل

Economic Statistical Design of Multivariate T^2 Control Chart with Variable Sample Sizes

Today, quality improvement and cost reduction are key factors for achieving business success, growth and position. One of the primary tools for quality improvement and cost reduction in online activities of statistical process control is control charts. As the need for monitoring several correlated quality characteristics is extensively growing, the use of multivariate control charts become...

متن کامل

Adaptive Quaternion Attitude Control of Aerodynamic Flight Control Vehicles

Conventional quaternion based methods have been extensively employed for spacecraft attitude control where the aerodynamic forces can be neglected. In the presence of aerodynamic forces, the flight attitude control is more complicated due to aerodynamic moments and inertia uncertainties. In this paper, a robust nero-adaptive quat...

متن کامل

Bayesian Inference for Spatial Beta Generalized Linear Mixed Models

In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016